Abnormality handling determination program, method for abnormality handling determination, and administration device

ABSTRACT

A non-transitory computer readable storage medium storing therein an abnormality handling determination program that causes a computer to execute a process, the process includes, acquiring state information of appointed items regarding to a state of each of a plurality of devices in a system depending on a detection of abnormal information of a first device among the plurality of devices, judging whether the state information of the first device deviates from the distribution range of the state information calculated by the state information of the devices except the first device for every item, and determining a handling for the first device based on a result of the judgment.

CROSS-REFERENCE TO RELATED APPLICATION

This application is based upon and claims the benefit of priority of theprior Japanese Patent Application No. 2015-160179, filed on Aug. 14,2015, the entire contents of which are incorporated herein by reference.

FIELD

The embodiments discussed herein are related to an abnormality handlingdetermination program, a method for abnormality handling determinationand an administration device.

BACKGROUND

The storage system is equipped with a plurality of disk devices, andworks as a large-capacity storage. The information processing deviceaccesses a plurality of disk devices through a network, etc.

When the state administration device in the storage system detects, forexample, a trouble sign of the disk device, the device performs specifichandling for the storage device to raise a reliability (a difficulty oflosing data). For example, when the state administration device detectsthe trouble sign of the disk device, the state administration devicestarts the mirroring with other disk devices and separates the diskdevice that the trouble sign was detected. The state administrationdevice in this way avoids the loosing of the data which is memorizedwhen the trouble really occurs to the disk device.

For example, the technique about the system with a plurality of diskdevices is listed in patent documents 1 and 2.

CITATION LIST Patent Document

[Patent document 1] Japanese Laid-Open Patent Publication No. 2008-52752

[Patent document 2] Japanese Laid-Open Patent Publication No.2006-164304

SUMMARY

However, the trouble sign may not accord with a state of the diskdevice. Therefore, when the device performs specific handling dependingon the detection of the trouble sign, there may be a case of handlingfor the factor that is different from the factor of the trouble of thedisk device or a case of performing the handling even though the troubledoes not occur.

By performing the handling that does not accord with the real state ofthe device, the replace of the disk device become frequent, and costincreases.

According to an aspect of the embodiments, a non-transitory computerreadable storage medium storing therein an abnormality handlingdetermination program that causes a computer to execute a process, theprocess includes acquiring state information of appointed itemsregarding to a state of each of a plurality of devices in a systemdepending on a detection of abnormal information of a first device amongthe plurality of devices, judging whether the state information of thefirst device deviates from the distribution range of the stateinformation calculated by the state information of the devices exceptthe first device for every item, and determining a handling for thefirst device based on a result of the judgment.

The object and advantages of the invention will be realized and attainedby means of the elements and combinations particularly pointed out inthe claims.

It is to be understood that both the foregoing general description andthe following detailed description are exemplary and explanatory and arenot restrictive of the invention.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram indicating an example of the storage systemaccording to an embodiment.

FIG. 2 is a diagram of flow chart explaining a flow of handling for thedisk device depending on detection of the trouble sign of the diskdevice according to the comparison example.

FIG. 3 is a diagram of flow chart explaining a flow of the processing ofstate administration devices according to the embodiment.

FIG. 4 is a diagram of hardware constitution of state administrationdevice (information processing device) according to the embodiment.

FIG. 5 is a diagram of constitution of the software block of the stateadministration device 100 depicted in FIG. 4.

FIG. 6 is a diagram indicating an example of the state information table150 depicted in FIG. 4 and FIG. 5.

FIG. 7 is a diagram explaining an example of the state information thatthe state monitoring module 141 explained in FIG. 5 acquires.

FIG. 8 is a diagram indicating an example of state/use mappinginformation 160 explained in FIG. 4 and FIG. 5.

FIG. 9 is a diagram of flow chart explaining a flow of the processing ofabnormality handling decision program 140 explained in FIG. 4 and FIG.5.

FIG. 10 is a diagram explaining the first example of the processing inflow chart of FIG. 9.

FIG. 11 is a diagram explaining the second example of the processing offlow chart in FIG. 9.

FIG. 12 is a diagram explaining the third example of the processing inflow chart of FIG. 9.

FIG. 13 is a diagram explaining the fourth example of the processing offlow chart in FIG. 9.

FIG. 14 is a diagram of flow chart explaining the details of theprocessing of process S41 in flow chart of FIG. 9.

FIG. 15 is a diagram explaining value “t effective calculation time” andvalue “t acquisition time”.

FIG. 16 is a diagram indicating an example of the storage systemaccording to the other embodiment.

FIG. 17 is a diagram indicating an example of the hardware constitutionof administration device 200 depicted in FIG. 16.

FIG. 18 is a diagram indicating an example of the software block of thestorage system depicted in FIG. 16.

FIG. 19 is a diagram indicating an example of state information table250 according to the other embodiment.

DESCRIPTION OF EMBODIMENTS

Embodiments will be described hereinafter according to the drawings.However, it is noted that the technical scope is not limited to theembodiments described below, but covers the matters described in theclaims and the equivalents thereof.

Storage System

FIG. 1 is a diagram indicating an example of the storage systemaccording to an embodiment. As illustrated in FIG. 1, the storage systemcomprises disk device group 105. The disk device group 105 has diskdevices “dd1”-“ddn” (below also called as disk device “dd”). Inaddition, each of disk devices “dd” connects with a CPU (not illustratedin FIG. 1) or other information processing devices (not illustrated inFIG. 1) through a disk interface unit 104, etc.

For example, the disk device group 105 depicted in FIG. 1 is consistedof disk array. The disk array indicates a technique to use a pluralityof disk devices “dd1”-“ddn” as one large-capacity disk device. Theadministration method of data in the disk array includes RAID (RedundantArrays of Inexpensive Disk devices), for example. The RAID technique isto operate as one virtual disk device by putting a plurality of diskdevices “dd” together, thereby increasing a redundant.

A different redundant degree is settable for each of disk array each.For example, the redundant degrees include RAID0, RAID1, RAID3, andRAID5, etc. The RAID0 indicates a method to separate single data and tostore it to a plurality of disk devices “dd” with a distribution manner,and is called as a striping. According to the RAID0, it is possible toaccess data for the data which is stored in distribution manner inparallel although there is not the redundancy of the data.

The RAID1 indicates a method to write the same data in a plurality ofdisk devices “dd” and is called a mirroring. The RAID1 has theredundancy of data since a plurality of disk devices “dd” hold the samedata. Therefore, it is possible to use memorized data for other diskdevice “dd” even if a faulty occurs in one disk device “dd”.

The RAID3, RAID5 have high more redundant degree for the RAID1. TheRAID3 is a method to divide data and write it in a plurality of diskdevices “dd” and store parity generated based on the data which isdivided into another disk device “dd”. In addition, the RAID5 indicatesa method to store both of data and parity in a plurality of disk devices“dd” with a distributed manner.

Comparison Example

FIG. 2 is a diagram of flow chart explaining a flow of handling for thedisk device depending on detection of the trouble sign of the diskdevice according to the comparison example.

S11: A state administration device according to the comparison exampledetects the abnormal information (also called as the trouble sign asfollows) of a certain disk device among disk device group depicted inFIG. 1. The state administration device detects the trouble sign basedon a comparison of a predetermined value determined beforehand and thevalue of each disk device, for example.

For example, the state administration device detects the trouble signwhen the accumulation operation time for disk device reaches at lifetime (predetermined value) set depending on the specification of thedisk device beforehand. Or, for example, the state administration devicedetects the trouble sign when the error number of times of the diskdevice reaches the predetermined value beforehand.

S12: The state administration device carries out a fixed handling whichis determined beforehand for the disk device that the trouble sign wasdetected. For example, the state administration device starts themirroring which writes same data in both of the disk device that thetrouble sign was detected and the disk device which is different fromthe disk device that the trouble sign was detected then separates thedisk device that the trouble sign was detected, as the handling. Inanother form, the state administration device changes the use of thedisk device that the trouble sign was detected for backup, as thehandling.

In this way, the state administration device performs the handling whichis determined beforehand when the device detects the trouble sign. Whentrouble really occurred to the disk device that the trouble sign wasdetected, it restrains that memorized data are lost and the reliability(a hardly losing) of data is maintained.

However, the trouble sign does not always to accord with the state ofthe disk device. For example, the disk device may work normally even ifthe trouble sign is detected based on the accumulation operation time.Or the state of the disk device may turn worse than an assumption whenthe trouble sign is detected.

In addition, the disk device may work normally when an error is a slighterror, even if the trouble sign is detected based on the error number oftimes. Or the state of the disk device may turn worse than an assumptionwhen the trouble sign was detected when the error is a serious error.

Accordingly, a fixed handling which performs depending on the detectionof the trouble sign may not accord with the state of the disk device andmay not be appropriate handling. By performing an inappropriatehandling, though the state of the disk device works normally, a diskdevice may become a target of the exchange early. Or there is a casethat the state of the disk device turns worse or a case which is notable to improve though there may be a possibility of improvement of thestate of the disk device.

In this way, exchange of the disk device becomes frequent, and cost mayincrease by performing specific handling depending on detection of thetrouble sign to improve the reliability of data. Appropriate handlingfor the disk device is different depending on a state to a real diskdevice. Therefore, it is desirable that handling depending on the realstate of the disk device is performed when trouble sign is detected.

Summary of Embodiment

Therefore, the state administration device according to the embodimentacquires the state information of the appointed item about the state ofeach of a plurality of devices depending on detection of the abnormalityinformation of the first device among the plurality of devices (diskdevice “dd”) included in the system. And the state administration devicejudges whether or not the state information of the first device deviatesfrom the distribution range of the state information calculated by thestate information of the device except the first device for theappointed item and decides the handling for the first device based on ajudgment result.

For example, the system indicates a storage system depicted in FIG. 1.The state information is information about the states of disk device“dd”. For example, the appointed item (also called as the item)according to the embodiment is either or combination of access load forthe disk device “dd”, temperature of the disk device “dd”, the number ofthe abnormal parts (bad sector) of disk device “dd”, and the operationtime of disk device “dd”.

In addition, the embodiment exemplifies the case that a device is diskdevice “dd”, but it is not a thing limited to this example. A device maybe an information processor, a wireless device, or machinery included ina factory. When a device is a device unlike the disk device “dd”, theitem may be different information about the states of the device (CPUutilization rate or delay information, etc.).

FIG. 3 is a diagram of flow chart explaining a flow of the processing ofstate administration devices according to the embodiment.

S21: The state administration device acquires the state information forevery item of a plurality of disk device “dd” including the first devicedepending on the detection of the abnormal information (trouble sign) ofa certain disk device “dd” (the first device). For example, the stateadministration device acquires state information of the disk devices“dd1”-“ddn” depending on the detection of the trouble sign of the diskdevice “dd2” among the disk device group 105 (referring to FIG. 1).

As similar to a comparison example, the state administration devicedetects trouble sign based on life or the error number of times. Forexample, the state administration device detects trouble sign when theoperation time of the disk device “dd” reaches the life determined basedon specification of disk device “dd”. Or, for example, the stateadministration device detects trouble sign when the error number oftimes of disk device “dd” is over the predetermined value which isdetermined beforehand.

S22: The state administration device judges whether the stateinformation of the first disk device “dd” deviates from the distributionrange of the state information calculated by the state information ofdisk device devices “dd” except the first disk device, based on stateinformation of a plurality of disk devices “dd” which is acquired forevery item. In other words, the state administration device judges, itemby item, whether the state information of the disk device “dd2” deviatesfrom the distribution range of the state information of the disk devices“dd1”, “dd3”-“ddn”.

S23: The state administration device decides the handling for the firstdisk device “dd” (disk device “dd2”) based on a judgment result. Forexample, the state administration device decides the handling for thedisk device “dd2” based on the item where a value of the stateinformation of the disk device “dd2” deviates from the distributionrange of the state information calculated by the state information ofthe disk devices “dd1”, “dd3”-“ddn”. Or the state administration devicedecides the handling indicating continuously using for the disk device“dd2” when there is not an item deviating.

In this way, it is possible that the state administration deviceaccording to the embodiment decides appropriate handling based on acomparison of the state information with other disk device “dd”, notperforming a fixed handling, depending on the trouble sign. Thereby itis possible that the state administration device decides the handlingaccording to a state of disk device “dd” even if the trouble sign doesnot accord with a state of real disk device “dd”.

Therefore, it is possible that the state administration device decidesappropriate handling depending on the state concerned when an abnormalstate really occurs in the disk device “dd”. Thereby it is possible thatthe state administration device realizes the improvement of the state ofdisk device “dd” and restrains the life aggravation of the disk device“dd”. In addition, it is possible that the state administration devicedecides appropriate handling indicating the use of continuation withoutperforming a change for disk device “dd” when an abnormal state does notoccur in the disk device “dd”.

In this way, it is possible that the state administration device holdsexchange frequency of disk device “dd” while improving the reliabilityof data, because it is restrained to limit a use of disk device “dd”.Therefore, it is possible that the state administration device decreasesthe cost while improving the reliability of data.

In addition, the normal distribution range (the standard value) of thestate information of each item of the disk device “dd” varies dependingon the access frequency for the disk device “dd” or duties orenvironment to use, for example. Therefore, about each item of diskdevice “dd”, it is not easy to calculate the standard value of the stateinformation.

In contrast, the state administration device according to the embodimentjudges whether the state information of the disk device “dd” wheretrouble sign was detected is the value that deviates from thedistribution range of the state information calculated by the stateinformation of other disk device “dd”. And the state administrationdevice judges whether an abnormal state occurs in the disk device “dd”where the trouble sign was detected based on a judgment result.Accordingly, it is possible that the state administration deviceaccording to the embodiment judges an item of the state informationindicating the abnormal state easily without setting the standard valueof the state information of each item for every disk device.

Then, according to FIG. 4, the hardware constitution of the stateadministration device according to the embodiment and according to FIG.5, software block diagram of the state administration device in FIG. 4will be described.

Hardware Constitution of State Administration Device

FIG. 4 is a diagram of hardware constitution of state administrationdevice (information processing device) according to the embodiment. Thestate administration device 100 includes a CPU (Central Processing Unit)101, a memory 102 having a main memory 110 and an auxiliary memory 111,etc., a communication interface unit 103, a disk interface unit 104, andthe disk device group 105. The all parts are connected through a bus 106mutually.

The CPU 101 is connected to the memory 102, etc. through the bus 106 andcontrols the whole state administration device 100. The communicationinterface unit 103 is connected to other apparatuses (not illustrated inFIG. 4) through the Internet, etc. and transmits and receives data.

The disk device group 105 has a plurality of disk devices “dd” asdepicted in FIG. 1. The disk interface unit 104 is connected to the diskdevice group 105 and performs control processing of each disk device“dd” in the disk device group 105.

The disk device group 105 provides the area based on the RAID asillustrated by FIG. 1. The disk interface unit 104 performs the writingof data in the disk device “dd” and reading of data memorized in thedisk device “dd” in response to the command accepted from the CPU 101,etc. In addition, the disk interface unit 104 performs setting of RAIDof mirroring or striping for the disk device “dd” and separating andcontrolling of offline or online in response to the command acceptedfrom the CPU 101.

The main memory 110 includes a RAM (Random Access Memory) memorizes thedata which the CPU 101 processes. The auxiliary memory 111 isconstructed by an HDD (Hard disk device drive) or a nonvolatilesemiconductor memory.

The auxiliary memory 111 has a domain (not illustrated in FIG. 4) whichstores the operation system which CPU 101 carries out and a domain (notillustrated in FIG. 4) which stores a program using the disk device“dd”. In addition, the auxiliary memory 111 has storage control programstorage domain 130, state information table storage domain 150,state/use mapping information storage domain 160. In addition, thestorage control program storage domain 130 has abnormality handlingdetermination (decision) program storage domain 140.

The storage control program (below called as storage control program130) in the storage control program storage domain 130 realizes anaccess control for the disk device group 105 and a storageadministration processing including a setting of RAID by execution ofthe CPU 101.

The abnormal handling decision program (below called as abnormalityhandling decision program 140) in the abnormality handling decisionprogram storage domain 140 realizes a decision processing of thehandling for the disk device “dd” concerned depending on the detectionof the trouble sign of the disk device “dd” by execution of the CPU 101.

A state information table (below called as state information table 150)in the state information table storage domain 150 is a table that theabnormality handling decision program 140 accesses and has stateinformation of a plurality of disk devices “dd”. The details of thestate information table 150 will be mentioned later according to FIG. 6.

A state/use mapping information (below called as state/use mappinginformation 160) in the state/use mapping information storage domain 160is the information that the abnormality handling decision program 140refers to and has correspondence information between the state and useof the disk device “dd”. The details of the state/use mappinginformation 160 will be mentioned later according to FIG. 8.

Software Block of State Administration Device 100

FIG. 5 is a diagram of constitution of the software block of the stateadministration device 100 depicted in FIG. 4. As illustrated in FIG. 5,the storage control program 130 has an abnormality handling decisionprogram 140, a data processing module 131, and a constitutionadministration module 132. The abnormality handling decision program 140has a state monitoring module 141 and a use choice module 142.

The state monitoring module 141 acquires state information of each diskdevice “dd” regularly (for example, ten minutes interval) and memorizesit to the state information table 150. In addition, the state monitoringmodule 141 holds the life or the error number of times of each diskdevice “dd” and detects the trouble sign, for example. And the statemonitoring module 141 acquires the state information of each disk device“dd” depending on the detection of the trouble sign and memorizes itinto the state information table 150 and notifies the use choice module142 of outbreak of the trouble sign.

When the use choice module 142 receives the notice indicating thedetection of the trouble sign from the state monitoring module 141, theuse choice module 142 refers to the state information table 150. And theuse choice module 142 judges, item by item, whether or not the stateinformation of disk device “dd” that the trouble sign was detecteddeviates from the distribution range of the state information calculatedby the state information of other disk devices “dd”. And the use choicemodule 142 refers to the state/use mapping information 160 based on thejudgment result, decides the handling for the disk device “dd” wheretrouble sign was detected, and requests the constitution administrationmodule 132 for execution of the handling.

The data processing module 131 instructs the access processing for thedisk device “dd” to the disk interface unit 104 in response to thecommand to instruct the access of data. The access processing includesthe writing data to the disk device “dd” and reading of data from thedisk device “dd”.

In addition, the constitution administration module 132 manages theinformation about the constitution of each disk device “dd”. Forexample, the information about the constitution of disk device “dd”includes information such as the group setting indicating theadministration unit of RAID or the setting of the redundant degree ofthe group. In addition, the constitution administration module 132carries out the handling (for example, change of the redundant degree ofthe group, use change to backup use, separate, change to near-online)depending on a request of the handling from the use choice module 142.

State Information Table 150

FIG. 6 is a diagram indicating an example of the state information table150 depicted in FIG. 4 and FIG. 5. For example, the state informationtable 150 depicted in FIG. 6 stores the items of the state informationincluding an item “disk ID (IDentify)”, an item “access load (IO/sec)”,an item “number of bad sectors”, an item “temperature” (Celsius), and anitem “operation time (sec)”. But the item is not a thing limited to anexample depicted in FIG. 6. The state information table 150 may have anitem of other state information. In addition, the item may be one or aplurality of number.

The item “disk ID” is information to distinguish the disk device “dd”.The item “access load (IO/sec)” indicates quantity of the accessprocessing for the disk device “dd” per one second. The item “number ofbad sectors” indicates the number of bad sectors among the sectors whichthe disk device “dd” has. The item “temperature” (Celsius) indicatestemperature of the disk device “dd”. The item “operation time (sec)”indicates total time of the operation time of the disk device “dd”.

According to the example of FIG. 6, for the disk ID “001”, the accessload is “6422 (IO/sec)”, the number of bad sectors is “3”, thetemperature is “32” (Celsius), and the operation time is “31536324(sec)”. In addition, for the disk ID “002”, the access load is “35240(IO/sec)”, the number of bad sectors is “2”, the temperature is “33”(Celsius), and the operation time is “31577324 (sec)”. The stateinformation table 150 has similar stare information of each item aboutthe disk device “dd” of other disk ID.

FIG. 7 is a diagram explaining an example of the state information thatthe state monitoring module 141 explained in FIG. 5 acquires. Asmentioned above in FIG. 5, the state monitoring module 141 acquiresstate information of each disk device “dd”, for example, every intervalfor ten minutes and memorizes it to the state information table 150.FIG. 7 represents an example of the state information at the time“2015-03-30 10:00:00” and at the time “2015-03-30 10:10:00”. In thisway, the state information table 150 maintains the state information ofeach disk device “dd” to indicate in FIG. 1 every interval at time.

State/Use Mapping Information 160

FIG. 8 is a diagram indicating an example of state/use mappinginformation 160 explained in FIG. 4 and FIG. 5. For example, thestate/use mapping information 160 depicted in FIG. 8 has correspondenceinformation between information of deviation item 1-N and the use.

The deviation item depicted in FIG. 8 indicates the item where the stateinformation of the disk device “dd” where trouble sign was detecteddeviates from the distribution range of the state information calculatedby the state information of other disk devices “dd”. In addition, theuse depicted in FIG. 8 indicates the handling method for the disk device“dd” where the trouble sign was detected.

According to the first line information in the state/use mappinginformation 160, the handling corresponding to the deviation item“access load” is a handling “lowering the access load” (striping: twodisk devices).

When the access load is high, that is, when the access is concentratedin the disk device “dd” where the trouble sign was detected, it may bein a factor of the faulty of the disk device “dd”. Accordingly, bysetting the striping (RAID 0) that disperses the access for the diskdevice “dd” to the disk device “dd” where trouble sign was detected andone other disk device “dd”, it become able to restrain access load.

According to the second line information in the state/use mappinginformation 160, the handling corresponding to a deviation item “numberof bad sectors” is a handling “raise redundant degree (mirroring)”. Thetrouble is more likely to occur in the disk device “dd” when the numberof bad sectors is large. Therefore, the redundancy degree of dataincreases by setting the mirroring (RAID 1) for the disk device “dd”that trouble sign was detected and one other device. Therefore, it ispossible to prepare for the trouble of the disk device “dd”.

Similarly, according to the third line information, the handlingcorresponding to the deviation item “temperature” is a handling“lowering the access load” (striping: two disk devices) like thehandling corresponding to the deviation item “access load”. In addition,according to the fourth line information, the handling of the deviationitem “operation time” is a handling “backup”. The handling “backup”indicates to change a use of the disk device “dd” where the trouble signwas detected for the backup.

According to the 11th line information in the state/use mappinginformation 160, the handling corresponding to the deviation item“access load” and the deviation item “temperature” are a handling“lowering the access load” (striping: three disk devices). When thetemperature is high in addition to access load, by more increasing thenumber of disk devices “dd” which is dispersed data, as compared withthe case only access load is high (first line), it is possible suppressthe access load more. Thereby it is possible to control a rise in thetemperature.

In addition, according to the 20th line information in the state/usemapping information 160, the handling when there is not a deviation itemis the handling “continuation use”. When there is not a deviation item,it indicates that an abnormal state does not occur in the stateinformation of the disk device “dd” where the trouble sign was detected.The handling “use of continuation” indicates to continue use withoutchanging the use or the setting of the redundant degree of the diskdevice “dd” where the trouble sign was detected.

Furthermore, according to the 21th line information in the state/usemapping information 160, the handling when all are the deviation item isa handling “separating”. When all items of the state informationdeviates from the distribution range of the state information calculatedby the state information of other disk devices “dd”, the faulty hasalready occurred in the disk device “dd”, and data is more likely to belost. Therefore, by separating the disk device “dd” where the troublesign was detected, it is possible to protect memorized data and torestrain influence of the trouble on other disk device “dd”.

Then, according to FIG. 9, the details of the processing of abnormalityhandling decision program 140 depicted in FIG. 4 and FIG. 5 will bedescribed. In addition, according to FIG. 10-FIG. 13, an example of theprocessing depicted in the flow chart of FIG. 9 will be described.

Flow of Processing of Abnormality Handling Decision Program 140

FIG. 9 is a diagram of flow chart explaining a flow of the processing ofabnormality handling decision program 140 explained in FIG. 4 and FIG.5.

S31: Depending on the detection of the trouble sign, the statemonitoring module 141 acquires the state information of each item of aplurality of disk devices “dd” including the disk device ID that thetrouble sign was detected and memorizes it to the state informationtable 150 (referring to FIG. 6). For example, when detecting the troublesign of disk device “dd2” of disk ID “0002”, the state monitoring module141 acquires the state information of the disk devices “dd1”-“ddn” ofthe disk ID “0001” . . . “000N” and memorizes it to the stateinformation table 150. In addition, the state monitoring module 141notifies the use choice module 142 of detection of the trouble sign.

S32: The use choice module 142 refers to the state information table 150(referring to FIG. 6) in response to the notice and calculates an teststatistic (also called as deviation level) based on the stateinformation of disk devices “dd1”-“ddn” which are acquired with item byitem. The deviation level indicates the degree that state information ofdisk device “dd2” where the trouble sign was detected deviates from thedistribution range of the state information calculated by stateinformation of other disk devices “dd1”, “dd3”-“ddn”. The details of thecalculation processing of the deviation level will be mentioned lateraccording to FIG. 10-FIG. 13.

S33: The use choice module 142 identifies the item which the teststatistic (deviation level) has a biggest among each item which arerepresented in the state information table 150 (referring to FIG. 6).

S34: The use choice module 142 judges whether the biggest test statistic(deviation level) which is identified in the process S33 is more than avalue “α”. The “α” level according to the embodiment is a value “4.0”.But the “α” level is not a thing limited to this example. For example,the “α” level is set according to the inspection, etc.

S35: When the test statistic is more than the value “α” (Yes of S34),the use choice module 142 compares the test statistic where the teststatistic which is identified is the biggest and the test statics ofother each item except the item of biggest test statistic. And, the usechoice module 142 identifies the item having the test statistics thatthe difference between the biggest test statistics is within value “β”.

The “β” level according to the embodiment is a value “1.0”. But the “β”level is not a thing limited to this example. The “β” level, as same asthe α level, is set according to the inspection, for example.

S36: The use choice module 142 judges whether or not an item having thedifference value within the value “β” which is identified in the processS35 exists.

S37: When there is not an item having the difference within the value“β” (No of S36), the use choice module 142 identifies the item of thesingle substance as the deviation item. In other words, the use choicemodule 142 identifies the item having the biggest test statistics as thedeviation item.

S38: When there is not an item having the difference within the value“β” (Yes of S36), the use choice module 142 identifies whether an itemwithin value “β” is some items.

S39: When the item having the difference within value “β” is some items(Yes of S38), the use choice module 142 identifies a plurality of itemsas the deviation items. In other words, the use choice module 142identifies the item where the test statistic is the biggest and someitems having the difference within value “β” as the deviation items.

In this way, the abnormality handling decision program 140 calculatesthe deviation level indicating the degree that the state information ofthe first device (the disk device that trouble sign was detected)deviates from the distribution range of the state information calculatedby the state information of the device except the first device for everyappointed item. And, the abnormality handling decision program 140judges that a first item having the biggest deviation value and a seconditem having the difference of the deviation level between the deviationlevel of the first item within the standard level (β level) deviateamong a plurality of items.

Thereby, it is possible that the abnormality handling decision program140 according to the embodiment identifies the item where an abnormalstate appears most conspicuously by selecting an item having a maximumdeviation level among a plurality of items. In addition, it is possiblethat the abnormality handling decision program 140 detects other itemsin conjunction with the item where an abnormal state appears mostconspicuously, based on the difference of the deviation level with theitem where the deviation level is the biggest.

The normal value ranges of the state information are different dependingon the item. Accordingly, the threshold to judge the deviation level isdifferent item by item. Therefore, it needs a man-hour and is not easyto set the threshold to judge whether the deviation level deviates itemby item.

In contrast, it is possible that the abnormality handling decisionprogram 140 according to the embodiment identifies a plurality of itemswhere abnormality appears in the state information effectively, based onthe comparison of the deviation level between the items. Thereby, it ispossible that the abnormality handling decision program 140 judgeseasily one or more deviation items appropriately without making thethreshold of the deviation level item by item.

S40: When item having the difference within value “β” is all items (Noof S38), the use choice module 142 identifies all items as the deviationitems.

S41: On the other hand, when the test statistic is less than the value“α” (No of S34), that is, when the test statistic of any items does notreach the value “α”, the use choice module 142 judges the stateinformation during the past appointed period. That is, the use choicemodule 142 identifies the deviation item based on the past stateinformation in the period counted back to the past for a scheduledperiod since the trouble sign was detected to be concrete.

An abnormal state may occur irregularly, because of the state of diskdevice “dd”. Therefore, an abnormal state may not appear in the stateinformation when trouble sign was detected. In contrast, the use choicemodule 142 according to the embodiment acquires the state information ofeach of a plurality of disk device “dd” during the period when goingback a scheduled period to since the abnormal information (trouble sign)was detected. And the use choice module 142 judges whether the stateinformation of the first device (disk device “dd” where trouble sign wasdetected) deviates from the distribution range of the state informationcalculated by the state information of the device except the firstdevice for every appointed items.

Thereby, it is possible that the abnormality handling decision program140 appropriately judges the item that an abnormal state produces bybeing based on the state information in the past appointed period, evenif the abnormality of the state information produces irregularly.Therefore, it is possible that the abnormality handling decision program140 decides appropriate handling depending on the state of disk device“dd”.

Or, although not illustrated in figures, but the use choice module 142may judge that there is not a deviation item when the test statistic isless than value “α” (No of S34).

S42: The use choice module 142 decides the handling depending on thejudgment result with reference to the state/use mapping information 160(referring to FIG. 8).

In this way, the abnormality handling decision program 140 judgeswhether the state information of disk device “dd” where trouble sign wasdetected deviates from the distribution range of the state informationcalculated by the state information of other disk device “dd”. And theabnormality handling decision program 140 decides appropriate handlingfor the disk device “dd” where trouble sign was detected based on thejudgment result.

That is, the use choice module 142 decides the handling corresponding tothe item concerned when there is an item that the state information ofdisk device “dd” where trouble sign was detected deviates from thedistribution range of the state information calculated by the stateinformation of other disk device “dd”. Thereby, it is possible that theuse choice module 142 decides appropriate handling depending on an itemof one or more state information which appear an abnormal state.

In addition, the use choice module 142 decides the handling indicatingcontinuously using of the disk device “dd” where trouble sign wasdetected, when there is not the item which deviates. Thereby, the usechoice module 142 decides the handling of continuously using which doesnot change the disk device “dd” where trouble sign was detected, whenthere is not an item of the state information which appears an abnormalstate.

S43: The use choice module 142 requests the constitution administrationmodule 132 for execution of the handling such as the change of theredundant degree or the change of the use which is decided. Theconstitution administration module 132 realizes the handling such as thechange of the redundant degree or the change of the use depending on therequest through the disk interface unit 104.

In addition, in the embodiment, a plurality of disk devices “dd”included in the system are devices that a distribution range of thestate information calculated by the state information in the normalcy isthe same. Therefore, it is possible to judge appropriately whether anabnormal state occurs in the disk device “dd” where trouble sign wasdetected by comparing the state information of the disk device “dd”where trouble sign was detected with the state information of other diskdevice “dd”. In addition, the use choice module 142 does not have tomake the standard value to judge the state information item by item.

FIRST EXAMPLE

FIG. 10 is a diagram explaining the first example of the processing inflow chart of FIG. 9. FIG. 10 exemplifies a case that trouble signoccurred in the disk device “dd22 of disk ID “0002”. As mentioned above,the use choice module 142 calculates a deviation level indicating thedegree that the state information of disk ID “0002” deviates from thedistribution range of the state information calculated by the stateinformation of the other disk ID “0001”, “0003” . . . “000N” item byitem.

For example, the use choice module 142 calculates a deviation value(test statistic) according to a formula “(“the value of the stateinformation of the disk device ID that trouble sign wasdetected”—“average of the state information”)/“standard deviation of thestate information”” item by item. The average of the state informationindicates a value of the average of the state information of the disk ID“0001” . . . “000N”. In addition, the standard deviation of the stateinformation is the square root of dispersion. The dispersion indicatesthe value that divided the value, that the square of the differencesbetween the value of the state information of each disk ID and theaverage are added sequentially, is divided according to the number ofthe disk IDs.

In this way, the use choice module 142 calculates the calculation levelthat is divided the deviation of the value of the state information ofthe target item by the standard deviation as the deviation level.Therefore, it is possible that the use choice module 142 calculates thedegree that the value of the state information of disk ID “0002”deviates from the distribution range of the value of the stateinformation of the other disk ID “0001”, “0003” . . . “000N” about thetarget item.

Thereby it is possible that the abnormality handling decision program140 easily judges whether the abnormality occurs in the stateinformation of each item of the disk device “dd2” where trouble sign wasdetected based on the deviation level. In addition, the calculationmethod of the deviation level is not a thing limited to this example.For example, the use choice module 142 may calculate a deviation levelbased on the expression “(the value of the state information of the diskdevice ID that trouble sign was detected)—(the average of the stateinformation)”.

FIG. 10 exemplifies the case that only item “access load” deviates fromthe distribution range of the state information calculated based on thestate information of other disk ID “0001”, “0003” . . . “000N”.According to the example of FIG. 10, the deviation level of the item“access load” is a value “4.52”. In other words, the degree, that thevalue “12443” of the access load of disk ID “0002” deviates from thedistribution range of the access load of the other disk ID “0001”,“0003” . . . “000N”, is the value “4.52”.

In addition, the deviation level of the item “number of bad sectors” isa value “0.44”. In other words, the degree, that number of bad sectors“2” of disk device ID “0002” deviates from the distribution range of thenumber of bad sectors of the other disk ID “0001”, “0003” . . . “000N”,is the value “0.44”. Similarly, the deviation level of the item“temperature” is a value “0.75” and a deviation level of the item“operation time” is a value “0.72”.

Therefore, the use choice module 142 identifies the item “access load”as the item where a deviation value (test statistic) is the biggest (S33of FIG. 9). In addition, the deviation value “4.52” of the item “accessload” is bigger than the value “4.0” (α level) (Yes of S34). Therefore,the use choice module 142 judges whether there is the item thatdifference from the deviation level of the item “access load” is withinthe value “1.0” (β level) (S36).

In an example of FIG. 10, the item, where the difference of thedeviation level from the item “access load” is within the value “1.0”,does not exist (No of S36). Therefore, the use choice module 142identifies only item “access load” as the deviation item (S37). And theuse choice module 142 refers to the state/use mapping information 160(referring to FIG. 8) and decides the handling “lowering the access load(striping: two disk devices)” corresponding to deviation item “accessload”.

And the use choice module 142 requests the constitution administrationmodule 132 for the control that the writing of data for the disk device“dd2” distributes to the disk device “dd2” and one other disk device“dd”. Thereby, it is possible to control the access load because theaccess for the disk device “dd2” is distributed.

In this way, it is possible that the abnormality handling decisionprogram 140 according to the embodiment decides the handlingappropriately corresponding to the item where a distribution range ofthe state information deviates from other disk device “dd”. Therefore,it is possible that the abnormality handling decision program 140decides appropriate handling corresponding to the state of disk device“dd” where trouble sign was detected.

SECOND EXAMPLE

FIG. 11 is a diagram explaining the second example of the processing offlow chart in FIG. 9. FIG. 11, as same as an example of FIG. 10,exemplifies the case that trouble sign produced in the disk device “dd2”of disk ID “0002”.

In the state information depicted in FIG. 11 a value of the item“temperature” is different from the state information indicating theexample of FIG. 10. Therefore, according to the example of FIG. 11, thedeviation levels of the item “access load”, the item “number of badsectors” and the item “operation time” are similar to an example of FIG.10. On the other hand, the deviation level of the item “temperature” isa value “4.18” unlike an example of FIG. 10.

FIG. 11 exemplifies the case that the item “access load” and the item“temperature” deviate from the distribution range of the stateinformation calculated by the state information of other disk ID “0001”,“0003” . . . “000N”. As same as an example of FIG. 10, the use choicemodule 142 identifies the item “access load” as the item of biggestdeviation level (S33 of FIG. 9).

In addition, in an example of FIG. 11, the differences of the deviationlevel of the item “temperature” and deviation level of the item “accessload” are within a value “1.0” (β level) (Yes of S36). Therefore, theuse choice module 142 identifies the item “temperature” and the item“access load” as the deviation item (S39).

In this way, it is possible that the abnormality handling decisionprogram 140 according to the embodiment identifies the plurality ofitems where the abnormality appear based on the difference of thedeviation level from the item with the biggest deviation level, withoutestablishing the threshold value by item.

In addition, as represented by the example of FIG. 11, it is possible toidentify the item that the deviation level is less than the α level asthe deviation item by setting a β level in a small value for an α level.For example, when the deviation level of the item “temperature” is avalue “3.5”, the difference of the deviation level with the item “accessload” is within the value “1.0” (β level), although the deviation levelis not satisfied with the α level. In this case, it is possible that theabnormality handling decision program 140 determines the item“temperature” as the deviation item.

In this way, when other one or more items except the item, that thedeviation level has a biggest, indicate that the deviation degree is notremarkable but appears the abnormal value, it is possible to judge theother item as the deviation item. Therefore, it is possible that theabnormality handling decision program 140 appropriately judges theplurality of items indicating the abnormal state among items of thestate information of the disk device “dd2” where trouble sign wasdetected.

And the use choice module 142 refers to the state/use mappinginformation 160 and decides the handling for the disk ID “0002” based ona combination of the item “access load” and the item “temperature”(S42). Especially, the use choice module 142 decides the handling“lowering the access load (striping: three disk devices)” correspondingto the combination of the item “access load” and the item “temperature”.

According to the example of FIG. 11, in the disk ID “0002”, the stateinformation of the item “temperature” in addition to the item “accessload” deviated from the distribution range of other disk device ID“0001”, “0003” . . . “000N”. As described in FIG. 8, when the item“temperature” is more identified as the deviation item, there is muchdisk device “dd” for distributing the data for an example of FIG. 10.

The use choice module 142 requests the constitution administrationmodule 132 for the control which distributes the writing of data for thedisk ID “0002” to the disk device “dd2” and two other disk devices “dd”(S43). It is possible to control the access load more and control a risein the temperature because the access for disk device dd2 is distributedmore.

In this way, it is possible that the abnormality handling decisionprogram 140 according to the embodiment decides the handlingappropriately corresponding to the item where a distribution range ofthe state information deviates from other disk device “dd”. Therefore,it is possible that the abnormality handling decision program 140decides appropriate handling corresponding to the state of disk device“dd” where trouble sign was detected.

THIRD EXAMPLE

FIG. 12 is a diagram explaining the third example of the processing inflow chart of FIG. 9. FIG. 12, as same as an example of FIG. 10,exemplifies the case that trouble sign produced in the disk device “dd2”of disk ID “0002”.

In the state information depicted in FIG. 12, a value of the item“number of bad sectors”, a value of the item “temperature” and a valueof the item “operation time” are different from the state informationindicating the example of FIG. 10. Therefore, according to the exampleof FIG. 12, the deviation level of the item “access load” is similar toan example of FIG. 10. On the other hand, the deviation levels of theitem “number of bad sectors” and the item “temperature” and the item“operation time” are different from an example of FIG. 10.

According to the example of FIG. 12, the deviation level of the item“number of bad sectors” is a value “4.13”, the deviation level of theitem “temperature” is a value “4.18” and the deviation level of the item“operation time” is a value “3.53”. As same as an example of FIG. 10,the use choice module 142 identifies the item “access load” as the itemof biggest deviation level (S33 of FIG. 9).

In addition, in an example of FIG. 12, the differences between thedeviation levels of the item “number of bad sectors”, of the item“temperature” and of the item “operation time” and the deviation levelof the item “access load” are within a value “1.0” (β level) (Yes ofS36). Therefore, the use choice module 142 identifies all the items asthe deviation item (S40).

And the use choice module 142 refers to the state/use mappinginformation 160 and decides the handling “separating” of the disk ID“0002” based on the deviation item (S42). When all items are deviationitems, trouble is more likely to really occur in the disk device “dd” ofdisk ID “0002”. Therefore, the use choice module 142 requests theconstitution administration module 132 for control of separation of thedisk ID “0002” (S43).

Thereby, it is possible to suppress the influence on other disk devices“dd1”, “dd3”-“ddn” while avoiding a loosing of data memorized in thedisk device “dd2” of disk ID “0002”. In this way, it is possible thatthe abnormality handling decision program 140 decides appropriatehandling corresponding to the state of disk device “dd” where troublesign was detected.

FOURTH EXAMPLE

FIG. 13 is a diagram explaining the fourth example of the processing offlow chart in FIG. 9. FIG. 13, as same as an example of FIG. 10,exemplifies the case that trouble sign produced in the disk device dd2of disk ID “0002”.

In the state information of FIG. 13, a value of the item “access load”is different from the state information in example of FIG. 10 and adeviation level of the item “access load” is different from an exampleof FIG. 10. In the example of FIG. 13, the deviation level of the item“access load” is a value “1.83”, the deviation level of the item “numberof the bad sectors” is a value “0.44”, the deviation level of the item“temperature” is a value “0.75”, and the deviation level of the item“operation time” is a value “0.72”.

As same as an example of FIG. 10, the use choice module 142 identifiesthe item “access load” as item having the biggest deviation level (S33of FIG. 9). But the deviation level “1.83” of the item “access load” isless than value “4.0” (No of S34).

As illustrated by an illustration of flow chart of FIG. 9, the usechoice module 142 judges that there is not a deviation item, forexample. In this case, the use choice module 142 decides the handling“use of continuation” as the handling of disk ID “0002” with referenceto the state/use mapping information 160 (S42). And the use choicemodule 142 notifies, for example, constitution administration module 132of continuously using of the disk ID “0002” (S43).

In this way, it is possible that the abnormality handling decisionprogram 140 according to the embodiment decides the handling ofcontinuously using of the disk device “dd2” when an abnormal state doesnot really occur in the disk device “dd2” where trouble sign wasdetected even if the trouble sign is detected. Thereby, it is possiblethat the abnormality handling decision program 140 decides appropriatehandling corresponding to the real state of disk device “dd2” wheretrouble sign was detected, even if the trouble sign is detected.Therefore, because the exchange frequency of disk device dd2 decreases,it is possible to suppress the increase of the cost.

Or, as illustrated by an illustration of flow chart of FIG. 9, the usechoice module 142 identifies a deviation item based on the stateinformation during the period counted back to the past during ascheduled period (S41). The processing of process S41 will be mentionedlater according to a flow chart of FIG. 14.

Flow of Processing of Abnormality Handling Decision Program 140

FIG. 14 is a diagram of flow chart explaining the details of theprocessing of process S41 in flow chart of FIG. 9. The processing ofprocess S55—process S62 depicted in FIG. 14 is similar to that ofprocess S33—process S40 in the flow chart of FIG. 9.

S51: The use choice module 142 sets the value “T trouble sign time” atthe value “T calculation time”. In other words, the use choice module142 sets the time when trouble sign was detected at the value “Tcalculation time”.

S52: The use choice module 142 sets the value that subtracted the value“t acquisition time” from the value “T calculation time” to the value “Tcalculation time”. In other words, the use choice module 142 decrementsthe value “T calculation time” by the value “t acquisition time”.

S53: The use choice module 142 judges whether the values that subtractedthe value “T calculation time” from the value “T trouble sign time”exceeds a value “t effective calculation time”. In other words, the usechoice module 142 judges whether a difference with the value “T troublesign time” and the value “T calculation time” falls into “t effectivecalculation time”.

Here, the value “t effective calculation time” and the value “tacquisition time” will be described according to FIG. 15.

t EFFECTIVE CALCULATION TIME, t ACQUISITION TIME

FIG. 15 is a diagram explaining value “t effective calculation time” andvalue “t acquisition time”. The “t acquisition time” indicates timeinterval to acquire the state information. As mentioned above, the “tacquisition time” according to the embodiment is ten minutes. Inaddition, the value “T calculation time” is the value that decrementedthe value “T trouble sign time” sequentially for value “t acquisitiontime”.

In addition, the “t effective calculation time” indicates the targetperiod which is judged the state information back to the past time from“the T trouble sign time” that the trouble sign was detected. In otherwords, the “t effective calculation time” indicates the period when thejudging of the state information counted back to the past time from “theT trouble sign time” is effective. The explanation will be continued byreturning to a flow chart of FIG. 14.

S54: When the difference is not over the value “t effective calculationtime” (No of S53), the use choice module 142 acquires state informationof all disk devices “dd1”-“ddn” at the value “T calculation time” andmemorize it to the state information table 150. And the use choicemodule 142 acquires the state information of disk devices “dd1”-“ddn” atthe value “T calculation time”, with reference to the state informationtable 150, and calculates the test statistic (deviation level), item byitem.

In other words, the use choice module 142 calculates the test statistic(deviation level) based on the state information at “the T calculationtime” when the difference with the value “T trouble sign time” fallsinto the “t effective calculation time”, for every items. On the otherhand, the use choice module 142 excludes the state information at thevalue “T calculation time” when the value “T calculation time” goes backmore than “t effective calculation time” from the value “T trouble signtime”, from the judgment target.

S55: Following process S54, the use choice module 142 identifies theitem which the test statistic which is calculated in process S54 has abiggest among a plurality of items.

S56: The use choice module 142 judges whether the biggest test statisticwhich is identified in the process S55 is more than the value “α”. Whenthe test statistic is less than the value “α” (No of S56), the usechoice module 142 moves to processing of process S52.

S57: When the test statistic is more than the value “α” (Yes of S56),the use choice module 142 compares the test statistic between the itemhaving a biggest test statistics and each other items. The use choicemodule 142 identifies other items that the differences of the teststatistic between the test statistics of the item that the teststatistic has a biggest are within the value “β”.

S58: The use choice module 142 judges there is an item that thedifference is within the value “β”.

S59: When there is not an item that the difference is within the value“β” (No of S58), the use choice module 142 judges the item of the simplesubstance as the deviation item.

S60: When there is an item that the difference is within the value “β”(Yes of S58), the use choice module 142 judges whether items that thedifference is within value “β” is some items.

S61: When an item that the difference is within value “β” is some items(Yes of S60), the use choice module 142 judges a plurality of items asthe deviation item.

S62: On the other hand, when an item that the difference is within value“β” is all items (No of S60), the use choice module 142 judges all itemsas the deviation item.

S63: when the difference is beyond value “t effective calculation time”(YES of S53), the use choice module 142 determines there is no deviationitem. In other words, the use choice module 142 determines that thedeviation item does not exist.

In this way, it is possible that the use choice module 142 decidesappropriate handling depending on a state of disk device “dd” by beingbased on the state information during the period that it is effective tojudge the state information of day counted back to the past time from“the T trouble sign time” that the trouble sign was detected. Therefore,it is possible that the use choice module 142 decides appropriatehandling according to a state of disk device “dd” even if theabnormality of the state information produces irregularly.

Other Embodiment

In addition, the embodiment described above exemplified an example ofhandling for the detection of the trouble sign in the disk device group105 of the simple substance. But the embodiment is effective for thedisk device “dd” included in a plurality of disk device groups. Thesystem according to the other embodiment has a plurality of disk devicegroups including a plurality of disk devices “dd” each.

The disk device group that a distribution range of the state informationcalculated by the state information in the normalcy is the sameindicates the plurality of disk device groups that access frequency andduties to carry out are similar. In this case, it is possible tointegrate disk devices “dd” in the plurality of disk device groups asthe target disk device “dd” that the state information is compared withthe disk device “dd” where trouble sign was detected. Therefore, becausethe number of disk device “dd” for the comparison increases, it ispossible to improve the judgment precision of the state information.

Storage System in Other Embodiment

FIG. 16 is a diagram indicating an example of the storage systemaccording to the other embodiment. The storage system depicted in FIG.16 has an administration device 200 and two storage devices 300 a, 300b. Each of the storage devices 300 a, 300 b is connected to theadministration device 200 through the network, etc. The storage device300 a has a disk device group 105 a, and the storage device 300 b has adisk device group 105 b.

The administration device 200 decides the handling for the disk device“dd” where trouble sign was detected depending on the detection of thetrouble sign of the disk device (also called as “dd1a”-“ddnb”, diskdevice “dd”) in each of disk device groups 105 a, 105 b, and notifies itto the storage device 300 a, 300 b.

As illustrated in FIG. 16, in the plurality of disk device groups 105 b,105 b, the total number of the disk devices “dd” is large. Therefore, itis possible that the abnormality handling decision program 140 judges anitem of the state information indicating the abnormal state more highlyprecise by being based on a comparison of the state information withlarge number of the disk devices “dd”.

Hardware Constitution of Administration Device 200

FIG. 17 is a diagram indicating an example of the hardware constitutionof administration device 200 depicted in FIG. 16. The administrationdevice 200 has a CPU 201, a memory 202 including a main memory 210 andan auxiliary memory 211, etc., a communication interface unit 203, andan external interface unit 204. The all parts are connected through abus 206 mutually.

The CPU 201 is connected to the memory 202, etc. through the bus 206 andcontrols the whole administration device 200. The communicationinterface unit 203 is connected to other apparatuses (not illustrated inFIG. 17) through the Internet and transmits and receives data.

The main memory 210 such as a RAM (Random Access Memory) memorizes thedata that the CPU 201 processes. The auxiliary memory 211 includes anHDD, or a nonvolatile semiconductor memory. The auxiliary memory 211 hasdomains (not illustrated in FIG. 17) storing the program of theoperation system that the CPU 201 carries out. In addition, theauxiliary memory 211 has an abnormality handling decision programstorage domain 240, a state information table storage domain 250, and astate/use mapping information storage domain 260.

The abnormal handling decision program (below called as abnormalityhandling decision program 240) in the abnormality handling decisionprogram storage domain 240 realizes decision processing of the handlingfor the disk device “dd” depending on the detection of the trouble signof disk device “dd” by execution of the CPU 201.

A state information table (called as state information table 250 asfollows) in the state information table storage domain 250 is the tablethat the abnormality handling decision program 240 accesses and hasstate information of disk device dd in each disk device groups 150 a,150 b. The details of the state information table 250 will be mentionedlater according to FIG. 18. The state/use mapping information (belowcalled as state/use mapping information 260) in the state/use mappinginformation storage domain 260 is similar to that in FIG. 8.

Software Block of Storage System

FIG. 18 is a diagram indicating an example of the software block of thestorage system depicted in FIG. 16. The storage control program 310 a inthe storage device 300 a has a data processing module 311 a, a statemonitoring module 312 a, and a constitution administration module 313 a.In addition, the storage control program 310 b in the storage device 300b has a similar module, too.

The data processing module 311 a instructs the access processing for thedisk device “dd” to the disk interface unit 304 a in response to acommand to order the access of data which is sent by other devices.

The state monitoring module 312 a acquires the state information of eachdisk device “dd” in the disk device group 105 a regularly, and transmitsit to the administration device 200. In addition, the state monitoringmodule 312 a sends the state information of each disk device “dd” to theadministration device 200 depending on the detection of the trouble signof the disk device “dd” in the disk device group 105 a and notifies theadministration device 200 and the storage device 300 b of outbreak ofthe trouble sign. In addition, the state monitoring module 312 a sendsthe state information of each disk device “dd” to the administrationdevice 200 in response to the notice of the detection of the troublesign of other storage device 300 b.

In addition, the constitution administration module 313 a manages theinformation about the constitution of each disk device “dd” in the diskdevice group 105 a. In addition, the constitution administration module313 a receives the handling for the disk device “dd” where trouble signwas detected by the administration device 200 and carries out thehandling such as the change of the redundant degree of the RAID or theuse change of disk device “dd”.

In addition, the abnormality handling decision program 240 in theadministration device 200 has a use choice module 241. When the usechoice module 241 receives a notice indicating the detection of thetrouble sign from the storage device 300 a, 300 b and the stateinformation of each disk device “dd” in a plurality of disk devicegroups 105 a, 105 b, the use choice module 241 memorizes it to the stateinformation table 250.

And the use choice module 241 judges whether the state information ofthe disk device “dd” where trouble sign was detected deviates from thedistribution range of the state information calculated by the stateinformation of disk devices “dd” in the plurality of disk device groups105 a, 105 b, for every item. In addition, the use choice module 241decides the handling for the disk device “dd” where the trouble sign wasdetected with reference to the state/use mapping information 260 basedon the judgment result.

And the use choice module 241 notifies the storage device 300 a, 300 bhaving the disk device “dd” where the trouble sign was detected of thedecided handling.

State Information Table 250

FIG. 19 is a diagram indicating an example of state information table250 according to the other embodiment. The state information table 250depicted in FIG. 19 has device ID to distinguish the disk device group105 a, 105 b in addition to the information of state information table150 (referring to FIG. 6). As illustrated in FIG. 19, the stateinformation table 250 has state information of a plurality of diskdevice groups 105 a, 105 b.

The processing of the abnormality handling decision program 240 depictedin FIG. 17 and FIG. 18 is same as the processing indicated in the flowchart in FIG. 9, FIG. 14. In other words, the abnormality handlingdecision program 240 judges whether the state information of the diskdevice that the trouble sign was detected deviates from the distributionrange of the state information calculated by the state information ofthe devices except the disk device that the trouble sign was detected inthe plurality of disk device groups 105 a, 105 b. Thereby, the number ofdisk devices “dd” for the comparison target increases, therefore thejudgment precision of the state information improves.

All examples and conditional language provided herein are intended forthe pedagogical purposes of aiding the reader in understanding theinvention and the concepts contributed by the inventor to further theart, and are not to be construed as limitations to such specificallyrecited examples and conditions, nor does the organization of suchexamples in the specification relate to a showing of the superiority andinferiority of the invention. Although one or more embodiments of thepresent invention have been described in detail, it should be understoodthat the various changes, substitutions, and alterations could be madehereto without departing from the spirit and scope of the invention.

What is claimed is:
 1. A non-transitory computer readable storage mediumstoring therein an abnormality handling determination program thatcauses a computer to execute a process, the process comprising:acquiring state information of appointed items regarding to a state ofeach of a plurality of devices in a system depending on a detection ofabnormal information of a first device among the plurality of devices;judging whether the state information of the first device deviates fromthe distribution range of the state information calculated by the stateinformation of the devices except the first device for every item; anddetermining a handling for the first device based on a result of thejudgment.
 2. The storage medium according to claim 1, wherein thedetermining comprises determining the handling corresponding to anappointed item which deviates when detecting the appointed item whichdeviates.
 3. The storage medium according to claim 1, wherein thedetermining comprises determining the handling of continuous use of thefirst device when not detecting the appointed item which deviates. 4.The storage medium according to claim 1, wherein the plurality ofdevices included in the system has same distribution range of the stateinformation calculated by the state information when normal.
 5. Thestorage medium according to claim 1, wherein the judging comprises:calculating deviation levels indicating a deviation degree that thestate information of the first device deviates from the distributionrange of the state information calculated by the state information ofthe devices except the first device for every item; and judging a firstitem that the deviation level is greatest among a plurality of items,and a second item that a difference of the deviation value from thefirst item is within a standard value, as item that deviates.
 6. Thestorage medium according to claim 1, wherein the judging furthercomprises: acquiring the state information of each of the plurality ofdevices during a period counted back to the past for a scheduled periodsince the abnormal information was detected; and judging whether theacquired state information of the first device deviates from thedistribution range of the state information calculated by the acquiredstate information of the devices except the first device for every item.7. The storage medium according to claim 1, wherein the acquiringcomprises acquiring the state information of appointed items regardingto the state of each of the plurality of storage devices.
 8. The storagemedium according to claim 1, wherein the acquiring comprises acquiringthe state information of appointed items, which is either one orcombination of an access load for the device, a temperature of thedevice, the number of abnormal parts in the device and an operation timeof the device, regarding to the state of each of the plurality ofdevices.
 9. The storage medium according to claim 1, wherein theacquiring comprises acquiring the state information of appointed itemsregarding to the state of each of the plurality of storage devices in aplurality of disk device groups.
 10. A method for abnormality handlingdetermination, the method comprising: acquiring, by a processor, stateinformation of appointed items regarding to a state of each of aplurality of devices in a system depending on a detection of abnormalinformation of a first device among the plurality of devices; judging,by the processor, whether the state information of the first devicedeviates from the distribution range of the state information calculatedby the state information of the devices except the first device forevery item; and determining, by the processor, a handling for the firstdevice based on a result of the judgment.
 11. The method according toclaim 10, wherein the determining comprises determining the handlingcorresponding to an appointed item which deviates when detecting theappointed item which deviates.
 12. The method according to claim 10,wherein the determining comprises determining the handling of continuoususe of the first device when not detecting the appointed item whichdeviates.
 13. The method according to claim 10, wherein the plurality ofdevices included in the system has same distribution range of the stateinformation calculated by the state information when normal.
 14. Themethod according to claim 10, wherein the judging comprises: calculatingdeviation levels indicating a deviation degree that the stateinformation of the first device deviates from the distribution range ofthe state information calculated by the state information of the devicesexcept the first device for every item; and judging a first item thatthe deviation level is greatest among a plurality of items and a seconditem that a difference of the deviation value with the first item iswithin a standard value as item that deviates.
 15. The method accordingto claim 10, wherein the judging further comprises: acquiring the stateinformation of each of the plurality of devices during a period countedback to the past for a scheduled period since the abnormal informationwas detected; and judging whether the acquired state information of thefirst device deviates from the distribution range of the stateinformation calculated by the acquired state information of the devicesexcept the first device for every item.
 16. The method according toclaim 10, wherein the acquiring comprises acquiring the stateinformation of appointed items, which is either one or combination of anaccess load for the device, a temperature of the device, the number ofabnormal parts in the device and an operation time of the device,regarding to the state of each of the plurality of devices.
 17. Anadministration device comprising: a processor configured to acquirestate information of appointed items regarding to a state of each of aplurality of devices in a system depending on a detection of an abnormalinformation of a first device among the plurality of devices, judgeswhether the state information of the first device deviates from thedistribution range of the state information calculated by the stateinformation of the devices except the first device for every item, anddetermines a handling for the first device based on a result of thejudgment; and a memory that stores the state information of theplurality of devices.
 18. The administration device according to claim17, wherein the processor determines the handling corresponding to anappointed item which deviates when detecting the appointed item whichdeviates.
 19. The administration device according to claim 17, whereinthe processor acquires the state information of appointed itemsregarding to the state of each of the plurality of storage devices. 20.The administration device according to claim 17, wherein the processoracquires the state information of appointed items regarding to the stateof each of the plurality of storage devices in a plurality of diskdevice groups.